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WHAT IS CLAIMED IS: 

1 1 . A method for computing a diversity measure for a predetermined combinatorial 

2 structure C having n elements, the method comprising steps of: 

3 (a) identifying M substructures c^ through each having m elements from among the n 

4 elements of the predetermined combinatorial structure C, where M equals n! / [(n-m)! m!]; 

5 (b) for each suWtructure for i from 1 to M, determining a number n^ of the M 

6 substructures Ci throiagh c^ that are similar to the substructure q; and 

7 (c) computing a fira entropy 0(m) based upon all the numbers n^ computed during step 

8 (b) and based upon M in\omputed step (a); 

12. A method as in claim V further comprising the steps of: 

2 (d) repeating steps (a) and (b) with m+1 substituted for m; 

3 (e) computing a second entroW 0(m+l) based upon all the numbers nj and M computed 

4 during step (d); and \ 

5 (f) subtracting the second entropyv<J>(m+l) from the first entropy 0(m) to produce the 

6 diversity measure. \ 

\l 3. A method as in claim 2, wherein steps (c) and (e) comprise the steps of: 
for each i from 1 to M: \ 

3 computing a fraction by dividir^ n^ by M; and 

4 computing a logarithm of fraction 

5 computing a sum by adding all logarithms orfractions Fi for i from I to M; and 

6 dividing the sum by M. \ 

14. A method as in claim 2, wherein step (b) comprise^he steps of, for each substructure 

2 Cj for i from 1 to M: \ 

3 for each substructure Cj for j from 1 to M: \ 

4 computing a distemce function d(q,Cj) representing measure of a diflTerence 

5 between substructure q and substructure Cj; \ 

6 comparing the distance function d(Ci,Cj) to a thresholdXand 
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7 V determining the substructures C; and Cj to be similar if and only if the distance 

8 \ function d(Ci,Cj) is less than the threshold. 

1 5. A method as in claim 2, wherein steps (c) and (e) comprise the steps of: 

2 for eabh distinct substructure c^: 

3 cismiputing a frequency ^ by dividing n^ by M; 

4 cornputing a logarithm of frequency ^; and 

5 computing a product by multiplying the frequency ^ and the logarithm of 

6 frequency 1^; and \ 

7 computing a sum by adding all products of the frequencies ^ and the logarithms of 

8 frequencies ^, \ 

16. A method as in claim 2, wierein step (b) comprises the steps of: 

2 for each substructure q for i from l\to M: 

3 monotonically renumbering niselements of q from 1 to m; and 

4 \^ for each substructure Cj for j from 1 to M: 

5 n monotonically renumberingsin elements of Cj from 1 to m; and 

6 v>/ determining the substructuresypi and Cj to be similar if and only if they are 
identical. \ 

17. A method as in claim 2, wherein step (b) comprises the steps of: 

2 for each substructure Ci for i from 1 to M: \ 

3 monotonically renumbering m elements of Cj from 1 to m; and 

4 for each substructure Cj for j from 1 to M: \ 

5 monotonically renumbering m elements of CjVom 1 to m; and 

6 determining the substructures q and Cj to be siMlar if and only if they are 

7 identical or isomorphic. \ 

1 8. A method as in claim 2, wherein steps (c) and (e) comprise tnfe steps of: 

2 for each distinct substructure q: \ 

3 computing a frequency £j by dividing n^ by M; \ 
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4 computing a quotient by dividing the frequency ^ by an expected frequency p;; 

5 \ computing a logarithm of quotient q^; and 

6 oomputing a product by multiplying the frequency ^ and the logarithm of 

7 quotient q^; and \ 

8 computing aysum by adding all products of the frequencies ^ and the logarithms of 

9 quotients q^. \ 

1 9. A method as in clami 2, wherein the predetermined combinational structure C 

2 comprises a linked graph, wherein the n elements comprise n nodes. 

1 10. A computer readable storage medium, comprising: 

2 computer readable program code embodied on said computer readable storage 

3 medium, said computer readable prognwn code for programming a computer to perform a 

4 method for computing a diversity measur^e for a predetermined combinatorial structure C 

5 having n elements, the method comprisingNsteps of: 

6 (a) identifying M substructures c^ throuMi c^ each having m elements from among the n 

7 elements of the predetermined combinatorial structure C, where M equals n! / [(n-m)! m!]; 

8 (b) for each substructure c^, for i from 1 to M, determining a number n^ of the M 

9 substructures Cj through c^ that are similar to the>substructure q; and 

10 (c) . computing a first entropy 0(m) based uponVll the numbers computed during step 

11 (b) and based upon M in computed step (a); \ 

12 \ 

13 11. A computer readable storage medium as in claim\o, the method further comprising 

14 the steps of: \ 

15 (d) repeating steps (a) and (b) with m+l substituted for m; 

16 (e) computing a second entropy 0(m+l) based upon all tti^^numbers n^ and M computed 

17 during step (d); and \ 

18 (f) subtracting the second entropy 0(m+l) from the first entropy 0(m) to produce the 

1 9 diversity measure. ^ 
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1 12. A computer readible storage medium as in claim 1 1, wherein steps (c) and (e) 

2 comprise the steps of: \ 

3 for each i from 1 to M: 

4 computing a^fraction by dividing n^ by M; and 

5 computing a ibgarithm of fraction Fj; 

6 computing a sum by aMing all logarithms of fractions F; for i from 1 to M; and 

7 dividing the sum by M. \ ' 

1 13. A computer readable storage medium as in claim 1 1, wherein step (b) comprises the 

2 steps of, for each substructure Cj foAi from 1 to M: 

3 for each substructure Cj for j from 1 to M: 

4 computing a distance mnction d(q,Cj) representing a measure of a difference 

5 ^ between substructure Cj and substructure Cj; 

dTV ^ comparing the distance ftinction d(Ci,Cj) to a threshold; and 

7 \\ UK/ determining the substrucmres q and Cj to be similar if and only if the distance 

^lJ^ ftinction d(Ci,Cj) is less than the threshold. 

1 14. A computer readable storage medium as in claim 1 1, wherein steps (c) and (e) 

2 comprise the steps of: \ 

3 for each distinct substructure q: \ 

4 computing a frequency ^ by dividing by M; 

5 computing a logarithm of frequency ^; and 

6 computing a product by multiplying the frequency ^ and the logarithm of 

7 frequency ^; and \ 

8 computing a sum by adding all products of the frequencies ^ and the logarithms of 

9 frequencies ^. \ 

1 15. A computer readable storage medium as in c\aim 11, wherein step (b) comprises the 

2 steps of: \ 

3 for each substructure q for i from 1 to M: \ 

4 monotonically renumbering m elements of C; from 1 to m; and 
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5 for each substructure Cj for j from 1 to M: 

6 monotonically renumbering m elements of Cj from 1 to m; and 

7 determining the substructures C; and Cj to be similar if and only if they are 

8 identical. \ 

1 16. A computer readable storage medium as in claim 1 1, wherein step (b) comprises the 

2 steps of \ 

3 for each substructure Cj for i from 1 to M: 

4 monotonically renumbmng m elements of q from 1 to m; and 

5 for each substructure Cj fbr j from 1 to M: 

6 monotonically renumbering m elements of Cj from 1 to m; and 

T\ \^ determining the subknictures c, and Cj to be similar if and only if they are 

1^ \ 

1 17. A computer readable storage medium as in claim 1 1, wherein steps (c) and (e) 

2 comprise the steps of \ 

3 for each distinct substructure q: \ 

4 computing a frequency ^ by ^dividing by M; 

5 computing a quotient by divid^g the frequency ^ by an expected frequency p^; 

6 computing a logarithm of quotfent and 

7 computing a product by multiplymg the frequency ^ and the logarithm of 

8 quotient q;; and \ 

9 computing a sum by adding all products ofUhe frequencies ^ and the logarithms of 
10 quotients q^. \ 

1 18. A computer readable storage medium as in claim 1 1, wherein the predetermined 

2 combinational structure C comprises a linked graph, ^Mherein the n elements comprise n nodes. 

1 19. A computer system, comprising: \ 

2 a processor; and "A 
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3 a processor readable storage medium coupled to the processor having processor 

4 readable program code embodied on said processor readable storage medium, said processor 

5 readable program code for programming the computer system to perform a method for 

6 computing a diveVsity measure for a predetermined combinatorial structure C having n 

7 elements, the method comprising steps of: 

8 (a) identifying Ml substructures Ci through c^ each having m elements from among the n 

9 elements of the predettermined combinatorial structure C, where M equals n! / [(n-m)! m!]; 

10 (b) for each substructure c^, for i from 1 to M, determining a number n^ of the M 

1 1 substructures c^ through c^^ that are similar to the substructure q; and 

12 (c) computing a first emropy 0(m) based upon all the numbers computed during step 

13 (b) and based upon M in conrmuted step (a); 

1 20, A computer system as in okim 19, the method fiirther comprising the steps of 

2 (d) repeating steps (a) and (b) vAth m+1 substituted for m; 

3 (e) computing a second entropy ^(m+1) based upon all the numbers and M computed 

4 during step (d); and \ 

sV. (f) subtracting the second entropy ^m+1) from the first entropy <&(m) to produce the 
diversity measure, \ 

1 21 . A computer system as in claim 20, wherein steps (c) and (e) comprise the steps of: 

2 for each i from 1 to M: \ 

3 computing a fraction Fj by dividingSnj by M; and 

4 computing a logarithm of fraction fA 

5 computing a sum by adding all logarithms of fractions Fj for i from 1 to M; and 

6 dividing the sum by M. \ 

1 22. A computer system as in claim 20, wherein step (b) comprises the steps of, for each 

2 substructure q for i from 1 to M: \ 

3 for each substructure Cj for j from 1 to M: \ 

4 computing a distance function d(Ci,Cj) representing a measure of a difference 

5 between substructure C; and substructure Cj; \ 
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comparing the distance function d(Ci,Cj) to a threshold; and 
determinihg the substructures q and Cj to be similar if and only if the distance 
function d(Ci,Cj) isVess than the threshold. 

23. A computer system as in claim 20, wherein steps (c) and (e) comprise the steps of 
for each distinct substructure q: 

computing a fi-e^uency ^ by dividing ni by M; 
computing a log^thm of frequency i^; and 

computing a product by multiplying the frequency ^ and the logarithm of 
frequency ^; and 

computing a sum by adding ^^1 products of the frequencies ^ and the logarithms of 
frequencies ^. 

24. A computer system as in claim 20^ wherein step (b) comprises the steps of 
for each substructure q for i from 1 to M: 

monotonically renumbering m elem^ts of q from 1 to m; and 

for each substructure C: for j from 1 t^ M: 

\ 

monotonically renumbering m elements of Cj from 1 to m; and 
determining the substructures qVnd Cj to be similar if and only if they are 
identical. 

25. A computer system as in claim 20, wherein yep (b) comprises the steps of 
for each substructure q for i from 1 to M: 

monotonically renumbering m elements of q ^m 1 to m; and 

for each substructure q for j from 1 to M: 

monotonically renumbering m elements o^q from 1 to m; and 
determining the substructures q and q to bd similar if and only if they are 

identical or isomorphic. 

26. A computer system as in claim 20, wherein steps (c) an\l (e) comprise the steps of 
for each distinct substructure q: 
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3 computing a frequency ^ by dividing n; by M; 

^ y-— ^ computing a quotient by dividing the frequency ^ by an expected frequency p;; 

5 ^""""---.....^computing a logarithm of quotient q^; and 

6 compiItif4g.^aproduct by multiplying the frequency ^ and the logarithm of 

7 quotient q,; and ^^^^--^^^^ 

8 computing a sum by adding all pro^Tlet^^the frequencies f; and the logarithms of 
quotients q^. 

1 27. A computer system as in claim 20, wherein the predeterminedcbinbinational structure 

2 C comprises a linked graph, wherein the n elements comprise n nodes. 
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